A Projective Simulation Scheme for a Partially-Observable Multi-Agent Game

نویسنده

  • Rasoul Kheiri
چکیده

We develop a two-defender (Alice and Bob) invasion game using the projective simulation method as an embodied model for artificial intelligence. We hope that it can be the first step towards the effect of perception on different actions in a given game. As a given perception of a given situation, the agent, say Alice, encounters some attack symbols coming from the right attacker, where she can learn to prevent. However, some of these percepts are invisible for her. Instead, she perceives some other signs that are related to her partner’s (Bob) task. We elaborate on an example in which an agent perceives an equal portion of percepts from both attackers. Alice can choose to concentrate on her job, though she loses some attacks. Alternatively, she can have some sort of cooperation with Bob to get and give help. It follows that the maximum blocking efficiency in concentration is just the minimum blocking efficiency in cooperation. Furthermore, Alice would have a choice to select two different forgetting factors for blocking attacks and for helping task. Therefore, she can choose between herself and the other. Consequently, selfishness is discerned as an only Nash equilibrium in this game. It is a pure strategy and Pareto optimal containing Shapley value in this superadditive coalition. Finally, we propose another perception for the same situation that can be tracked in the future based on the present study.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A reinforcement learning scheme for a multi-agent card game with Monte Carlo state estimation

This article presents the state estimation method based on Monte Carlo sampling in a partially observable situation. We formulate an automatic strategy acquisition problem for the multi-agent card game “Hearts” as a reinforcement learning (RL) problem. Since there are often a lot of unobservable cards in this game, RL is dealt with in the framework of a partially observable Markov decision proc...

متن کامل

A reinforcement learning scheme for a multi-agent card game

We formulate an automatic strategy acquisition problem for the multi-agent card game “Hearts” as a reinforcement learning (RL) problem. Since there are often a lot of unobservable cards in this game, RL is approximately dealt with in the framework of a partially observable Markov decision process (POMDP). This article presents a POMDP-RL method based on estimation of unobservable state variable...

متن کامل

Partially observable stochastic game-based multi-agent prediction markets

We present a novel representation of the prediction market using a partially observable stochastic game with information (POSGI), that can be used by each trading agent to precisely calculate the state of the market. We then propose that a correlated equilibrium (CE) strategy can be used by the agents to dynamically calculate the prices at which they should trade securities in the prediction ma...

متن کامل

Model-Based Reinforcement Learning for Partially Observable Games with Sampling-Based State Estimation

Games constitute a challenging domain of reinforcement learning (RL) for acquiring strategies because many of them include multiple players and many unobservable variables in a large state space. The difficulty of solving such realistic multiagent problems with partial observability arises mainly from the fact that the computational cost for the estimation and prediction in the whole state spac...

متن کامل

Game-Theoretic Agent Programming in Golog Under Partial Observability

In this paper, we present the agent programming language POGTGolog, which is a combination of explicit agent programming in Golog with game-theoretic multi-agent planning in a special kind of partially observable stochastic games (POSGs). It is a generalization of the agent programming language GTGolog by partial observability. The approach allows for partially specifying a high-level control p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016